Search CORE

282 research outputs found

Robo-line storage: Low latency, high capacity storage systems over geographically distributed networks

Author: Anderson Thomas E.
Katz Randy H.
Ousterhout John K.
Patterson David A.
Publication venue
Publication date
Field of study

Rapid advances in high performance computing are making possible more complete and accurate computer-based modeling of complex physical phenomena, such as weather front interactions, dynamics of chemical reactions, numerical aerodynamic analysis of airframes, and ocean-land-atmosphere interactions. Many of these 'grand challenge' applications are as demanding of the underlying storage system, in terms of their capacity and bandwidth requirements, as they are on the computational power of the processor. A global view of the Earth's ocean chlorophyll and land vegetation requires over 2 terabytes of raw satellite image data. In this paper, we describe our planned research program in high capacity, high bandwidth storage systems. The project has four overall goals. First, we will examine new methods for high capacity storage systems, made possible by low cost, small form factor magnetic and optical tape systems. Second, access to the storage system will be low latency and high bandwidth. To achieve this, we must interleave data transfer at all levels of the storage system, including devices, controllers, servers, and communications links. Latency will be reduced by extensive caching throughout the storage hierarchy. Third, we will provide effective management of a storage hierarchy, extending the techniques already developed for the Log Structured File System. Finally, we will construct a protototype high capacity file server, suitable for use on the National Research and Education Network (NREN). Such research must be a Cornerstone of any coherent program in high performance computing and communications

NASA Technical Reports Server

On data skewness, stragglers, and MapReduce progress indicators

Author: Chambers J. M.
Dai J.
Gufler B.
Herodotou H.
Herodotou H.
Li J.
Ousterhout K.
Zaharia M.
Publication venue
Publication date: 01/01/2015
Field of study

We tackle the problem of predicting the performance of MapReduce applications, designing accurate progress indicators that keep programmers informed on the percentage of completed computation time during the execution of a job. Through extensive experiments, we show that state-of-the-art progress indicators (including the one provided by Hadoop) can be seriously harmed by data skewness, load unbalancing, and straggling tasks. This is mainly due to their implicit assumption that the running time depends linearly on the input size. We thus design a novel profile-guided progress indicator, called NearestFit, that operates without the linear hypothesis assumption and exploits a careful combination of nearest neighbor regression and statistical curve fitting techniques. Our theoretical progress model requires fine-grained profile data, that can be very difficult to manage in practice. To overcome this issue, we resort to computing accurate approximations for some of the quantities used in our model through space- and time-efficient data streaming algorithms. We implemented NearestFit on top of Hadoop 2.6.0. An extensive empirical assessment over the Amazon EC2 platform on a variety of real-world benchmarks shows that NearestFit is practical w.r.t. space and time overheads and that its accuracy is generally very good, even in scenarios where competitors incur non-negligible errors and wide prediction fluctuations. Overall, NearestFit significantly improves the current state-of-art on progress analysis for MapReduce

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- LUISS Libera Università Internazionale degli Studi Sociali Guido Carli di Roma

Archivio della ricerca- Università di Roma La Sapienza

Tcl and the Tk toolkit

Author: Jones Ken
Ousterhout John K
Publication venue: Addison-Wesley
Publication date: 01/01/2009
Field of study

CERN Document Server

Diskless supercomputers: Scalable, reliable I/O for the Tera-Op technology base

Author: Katz Randy H.
Ousterhout John K.
Patterson David A.
Publication venue
Publication date
Field of study

Computing is seeing an unprecedented improvement in performance; over the last five years there has been an order-of-magnitude improvement in the speeds of workstation CPU's. At least another order of magnitude seems likely in the next five years, to machines with 500 MIPS or more. The goal of the ARPA Teraop program is to realize even larger, more powerful machines, executing as many as a trillion operations per second. Unfortunately, we have seen no comparable breakthroughs in I/O performance; the speeds of I/O devices and the hardware and software architectures for managing them have not changed substantially in many years. We have completed a program of research to demonstrate hardware and software I/O architectures capable of supporting the kinds of internetworked 'visualization' workstations and supercomputers that will appear in the mid 1990s. The project had three overall goals: high performance, high reliability, and scalable, multipurpose system

NASA Technical Reports Server

A Benchmarking Tool for Wireless Sensor Network Embedded Operating Systems

Author: Guthaus
Hujumg
Koopman
Lee
Lee
Li
Mohamed K. Watfa
Mohamed Moubarak
Moubarak
Nazhandali
Ousterhout
Park
Tanenbaum
Welsh
Wright
Yi
Publication venue: 'Academy Publisher'
Publication date
Field of study

Crossref

A Light Weight Name Service and its use within a collaborative editor

Author: A Birrell
A Tanenbaum
D Cheriton
D Corner
J Howard
J Ousterhout
J-C Lugeon
JP Deschrevel
K Sollins
RG Guy
RV Linden
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1995
Field of study

Crossref

Python as a Federation Tool for GENESIS 3.0

Author: A Davison
A Dorval
A Gortechnikov
A Martelli
Allan D. Coop
Armando L. Rodriguez
C Günay
D Goodman
D Pecevski
E De Schutter
E De Schutter
E Nordlie
ES Raymond
F Brooks
G Ascoli
H Cornelis
H Cornelis
H Cornelis
H Cornelis
H Cornelis
Hugo Cornelis
J Bettencourt
J Fiala
James M. Bower
JG King
JK Ousterhout
JK Ousterhout
JM Eppler
K Blackwell
Kelvin E. Jones
L Huo
L Wall
M Diesmann
M Djurfeldt
M Hines
ML Hines
NH Goddard
P Gleeson
P Gleeson
R O'Hara
R Subhasis
S Crook
S Wils
U Bhalla
Publication venue: Public Library of Science
Publication date: 20/01/2012
Field of study

The GENESIS simulation platform was one of the first broad-scale modeling systems in computational biology to encourage modelers to develop and share model features and components. Supported by a large developer community, it participated in innovative simulator technologies such as benchmarking, parallelization, and declarative model specification and was the first neural simulator to define bindings for the Python scripting language. An important feature of the latest version of GENESIS is that it decomposes into self-contained software components complying with the Computational Biology Initiative federated software architecture. This architecture allows separate scripting bindings to be defined for different necessary components of the simulator, e.g., the mathematical solvers and graphical user interface. Python is a scripting language that provides rich sets of freely available open source libraries. With clean dynamic object-oriented designs, they produce highly readable code and are widely employed in specialized areas of software component integration. We employ a simplified wrapper and interface generator to examine an application programming interface and make it available to a given scripting language. This allows independent software components to be ‘glued’ together and connected to external libraries and applications from user-defined Python or Perl scripts. We illustrate our approach with three examples of Python scripting. (1) Generate and run a simple single-compartment model neuron connected to a stand-alone mathematical solver. (2) Interface a mathematical solver with GENESIS 3.0 to explore a neuron morphology from either an interactive command-line or graphical user interface. (3) Apply scripting bindings to connect the GENESIS 3.0 simulator to external graphical libraries and an open source three dimensional content creation suite that supports visualization of models based on electron microscopy and their conversion to computational models. Employed in this way, the stand-alone software components of the GENESIS 3.0 simulator provide a framework for progressive federated software development in computational neuroscience

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The impact of operating system scheduling policies and synchronization methods of performance of parallel applications

Author: Andrew Tucker
Anoop Gupta
Baskett Forest
Carrasco F. J.
Chandra Rohit
Dan
Edler Jan
Ewing
Jeffrey
Lazowska Edward D.
Ousterhout John K.
Shigeru Urushibara
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Receiver-driven layered multicast

Author: CASNER S.
DEERING S. E.
DEMERS A.
FENNER W.
FLOYD S.
JAFFE J. M.
Martin Vetterli
OUSTERHOUT J. K.
SCHULZRINNE H.
SPEER M. F.
Steven McCanne
TAUBMAN D.
TURLETTI T.
Van Jacobson
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Tracking studies of the Compact Linear Collider collimation system

Author: A. Fassò
A. Latina
A. Latina
D. Schulte
D. Schulte
G. A. Blair
G. Rumolo
H. Burkhardt
H. Burkhardt
H. Grote
I. Agapov
J. Resta-López
J. K. Ousterhout
J. L. Fernandez-Hernando
J. W. Eaton
R. Assmann
S. Malton
S. Redaelli
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2009
Field of study

A collimation system performance study includes several types of computations performed by different codes. Optics calculations are performed with codes such as MADX, tracking studies including additional effects such as wakefields, halo and tail generation, and dynamical machine alignment are done with codes such as PLACET, and energy deposition can be studied with BDSIM. More detailed studies of hadron production in the beam halo interaction with collimators are better performed with GEANT4 and FLUKA. A procedure has been developed that allows one to perform a single tracking study using several codes simultaneously. In this paper we study the performance of the Compact Linear Collider collimation system using such a procedure

Royal Holloway Research Online

Crossref

Royal Holloway - Pure

Directory of Open Access Journals

CERN Document Server